Picture for Yixin Cao

Yixin Cao

NEX: Neuron Explore-Exploit Scoring for Label-Free Chain-of-Thought Selection and Model Ranking

Add code
Feb 05, 2026
Viaarxiv icon

Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

Add code
Feb 02, 2026
Viaarxiv icon

CoDiQ: Test-Time Scaling for Controllable Difficult Question Generation

Add code
Feb 02, 2026
Viaarxiv icon

EMemBench: Interactive Benchmarking of Episodic Memory for VLM Agents

Add code
Jan 23, 2026
Viaarxiv icon

Thinking Traps in Long Chain-of-Thought: A Measurable Study and Trap-Aware Adaptive Restart

Add code
Jan 17, 2026
Viaarxiv icon

What Do LLM Agents Know About Their World? Task2Quiz: A Paradigm for Studying Environment Understanding

Add code
Jan 14, 2026
Viaarxiv icon

ARM: Role-Conditioned Neuron Transplantation for Training-Free Generalist LLM Agent Merging

Add code
Jan 12, 2026
Viaarxiv icon

SCALER:Synthetic Scalable Adaptive Learning Environment for Reasoning

Add code
Jan 08, 2026
Viaarxiv icon

Do LLMs Signal When They're Right? Evidence from Neuron Agreement

Add code
Oct 30, 2025
Viaarxiv icon

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Add code
Jul 08, 2025
Figure 1 for CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
Figure 2 for CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
Figure 3 for CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
Figure 4 for CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
Viaarxiv icon